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We consider communication over Automatic Repeat reQuest (ARQ) memoryless channels 
with deadlines. In particular, an upper bound L is imposed on the maximum number of ARQ 
transmission rounds. In this setup, it is shown that incremental redundancy ARQ outperforms 
Forney's memoryless decoding in terms of the achievable error exponents. 
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Abstract 



In [1], Burnashev characterized the maximum error exponent achievable over discrete memoryless 
channels (DMCs) in the presence of perfect output feedback. Interestingly, Forney has shown 
that even one-bit feedback increases the error exponent significantly [2]. More specifically, Forney 
proposed a memoryless decoding scheme, based on the erasure decoding principle, which achieves a 
significantly higher error exponent than that achievable through maximum likelihood (ML) decoding 
without feedback [3]. In Forney's scheme, the transmitter sends codewords of block length N. After 
receiving each block of N symbols, the receiver uses a reliability-based erasure decoder and feeds 
back one ACK/NACK bit indicating whether it has accepted/erased the received block, respectively. 
If the transmitter receives a NACK message, it then re-transmits the same A^-symbol codeword. 
After each transmission round, the receiver attempts to decode the message using only the latest 



N received symbols, and discards the symbols received previously. This process is repeated until 
the receiver decides to accept the latest received block and transmits an ACK message back to the 
transmitter. 

It is intuitive to expect a better performance from schemes that do not allow for discarding 
the previous observations at the decoder, as compared with memoryless decoding. Our work here 
is concerned with one variant of such schemes, i.e., Incremental Redundancy Automatic Repeat 
reQuest (IR- ARQ) [4]. We further impose a deadline constraint in the form of an upper bound L 
on the maximum number of ARQ rounds. In the asymptotic case L —>■ oo, we argue that IR-ARQ 
achieves the same error exponent as memoryless decoding, denoted by Ep(R)- On the other hand, 
for finite values of L, it is shown that IR-ARQ generally outperforms memoryless decoding, in terms 
of the achievable error exponents (especially at high rates and/or small values of L). For example, 
we show that L = 4 is enough for IR-ARQ to achieve Ep(R) for any binary symmetric channel 
(BSC), whereas the performance of memoryless decoding falls significantly short from this limit. 

*The authors are with the ECE Dept., The Ohio State University, Emails: {gopalap,namy,helgamal}@ece. osu.edu. 
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The rest of this correspondence is organized as follows. In Section 2, we briefly review the 
memoryless decoding scheme without any delay constraints, and argue that allowing for memory 
in decoding does not improve the error exponent. The performance of the memoryless decoder and 
the incremental redundancy scheme, under the deadline constraint, is characterized in Section 3. 
In Section 4, we consider specific channels (like the BSC, VNC and AWGN channels) and quantify 
the performance improvement achieved by incremental redundancy transmission. Finally, some 
concluding remarks are offered in Section 5. 



2 The ARQ Channel 

We first give a brief overview of the memoryless decoding scheme proposed by Forney in [2]. The 
transmitter sends a codeword x m of length N, where m G {1, • • • , M}. Here M represents the total 
number of messages at the transmitter, each of which is assumed to be equally likely. The trans- 
mitted codeword reaches the receiver after passing through a memoryless channel with transition 
probability p(y\x). We denote the received sequence as y. The receiver uses an erasure decoder 
which decides that the transmitted codeword was x m iff y G TZ m , where 

n m =\y: > e-l (1) 

where T > is a controllable threshold parameter. If (1) is not satisfied for any m G {1, • • • , M}, 
then the receiver declares an erasure and sends a NACK bit back to the transmitter. On receiving 
a NACK bit, the transmitter repeats the codeword corresponding to the same information message. 
We call such a retransmission as an ARQ round. The decoder discards the earlier received sequence 
and uses only the latest received sequence of N symbols for decoding (memoryless decoding). It 
again applies the condition (1) on the newly received sequence and again asks for a retransmission 
in the case of an erasure. When the decoder does not declare an erasure, the receiver transmits 
an ACK bit back to the transmitter, and the transmitter starts sending the next message. It is 
evident that this scheme allows for an infinite number of ARQ rounds. This scheme can also be 
implemented using only one bit of feedback (per codeword) by asking the receiver to only send back 
ACK bits, and asking the transmitter to keep repeating continuously until it receives an ACK bit. 
Since the number of needed ARQ rounds for the transmission of a particular message is a random 
variable, we define the error exponent of this scheme as follows. 

Definition 1 The error exponent E(R) of a variable-length coding scheme is defined as 

E{R) = limsup - l0g ! r(E) , (2) 

AT->-oc T 

where Pr(E) denotes the average probability of error, R denotes the average transmission rate, and 
f = (In M/R) is the average decoding delay of the scheme, when codewords of block length N are 
used in each ARQ transmission round. 

The probability of error of the decoder in (1), after each ARQ round, is given by [2] 
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and the probability of erasure is given by 

\ m y£R m J 
It is shown in [2] that these probabilities satisfy 

Pr(X) < e -^i(«i-T) an d p r ( e ) < e -"MRi,T) ? (3) 

where i?i = (mM/TV) denotes the rate of the first transmission round, 

E 2 (R U T) = E^Tj+T, (4) 
and Ei(Ri,T) is given at high rates by [2] 

E 1 (R 1 ,T) = max E (s, p, p) - pR ± - sT, (5) 

0<«<p<l,P 



E (s,p,p) = -\og J (^J p(x)p(y\x) {1 s) dx^j (^J p(x)p(y\x) {s/p) dx^j dy , 



(6) 



and at low rates by 



E^RuT) = n max E x (s, p, p) - pR 1 - sT, (7) 



E x (s,p,p) = -p log J J p(x)p(x 1 ) (^J p{y\x) {1 s) p(y|xi) s dy^ dx dxi , 



(8) 



where p = {p(x),Wx} denotes the input probability distribution (We note that for discrete mem- 
oryless channels, the integrals in (6) and (8) are replaced by summations). The average decoding 
delay r of the memoryless decoding scheme is given by 

00 

r = kN Pr (Transmission stops after k ARQ rounds) 
k=i 

00 N 
Y,kN [Pr(X)]( fc - 1 )[l-Pr(X)] = { _ — , 



N 

kN [Pr(A)j VK " ±; [l - Pr(A)j = 

k=l 

which implies that the average effective transmission rate is given by 



R= ^M = («) I1 -P lM . 8l [ 1 -P, M 



(9) 



It is clear from (3) and (9) that R — > Ri as N — > 00 if E 1 (R 1 ,T) > 0. The overall average 
probability of error can be now computed as 

00 

Pr(E) = ^[Pr(X)]^ 1 ) Pr(e) = Pr(e) [1 + o(l)] , (10) 
k=i 



where the second equality follows from (3) when Ei(Ri,T) > 0. It is, therefore, clear that the error 
exponent achieved by the memoryless decoding scheme is 

m = , imsup _igg^a±f(i)]) > EM 

N-^oo T 

It is shown in [2] that choosing the threshold T such that E\(R\,T) — > maximizes the exponent 
E 2 (Ri,T) while ensuring that R — > i?i as AT — > oo. This establishes the fact that the memoryless 
decoding scheme achieves the feedback error exponent E F (R) defined as 

E F (R) = lim E 2 (R,T) = lim T . (11) 

At this point, it is interesting to investigate whether a better error exponent can be achieved 
by employing more complex receivers which exploit observations from previous ARQ rounds in 
decoding (instead of discarding such observations as in memoryless decoding). Unfortunately, it is 
easy to see that this additional complexity does not yield a better exponent in the original setup 
considered by Forney [2]. The reason is that, as shown in (10), the overall probability of error in 
this setup is dominated by the probability of error Pr(e) in the first transmission round. So, while 
our more complex decoding rule might improve the probability of error after subsequent rounds, 
this improvement does not translate into a better error exponent. In the following section, however, 
we show that in scenarios where a strict deadline is imposed on the maximum number of feedback 
rounds, significant gains in the error exponent can be reaped by properly exploiting the received 
observations from previous ARQ rounds (along with the appropriate encoding strategy). 



3 ARQ with a Deadline 

In many practical systems, it is customary to impose an upper bound L on the maximum number of 
ARQ rounds (in our notation, L > 2 since we include the first round of transmission in the count) . 
Such a constraint can be interpreted as a constraint on the maximum allowed decoding delay or 
a deadline constraint. With this constraint, it is obvious that the decoder can no longer use the 
rule in (1) during the L th ARQ round. Therefore, after the L th round, the decoder employs the 
maximum likelihood (ML) decoding rule to decide on the transmitted codeword. We denote the 
probability of error of the ML decoder by Pr (Mi) (e). 



3.1 Memoryless Decoding 

The following theorem characterizes lower and upper bounds on the error exponent achieved by the 
memoryless decoding scheme, under the deadline constraint L. 

Theorem 2 The error exponent E MD (R,L) achieved by memoryless decoding, under a deadline 
constraint L, satisfies 1 (for < R < C) 

< E MD (R,L) < LE sp {R) , (12) 

1 We note that a tighter lower bound may be obtained by using the expurgated exponent E ex (R) instead of the 
random coding exponent E r {R) at low rates. This observation will be used when generating numerical results. 



E r (R) + (L — 1) 



max 

0<s<P<l,P 



E (s,p,p) - pR- sE r (R) 
l + s(L-2) 
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where E r (R) and E sp (R) denote the random coding and sphere packing exponents of the memoryless 
channel, and E a (s, p,p) is as given in (6). 

Proof: The average decoding delay of memoryless decoding is given by 

vfe=l / 

J2(k + l)N[Pr(X)A - (j2kN[Pr(X)}> 

= N(j2 [Pr(X)A = N [l + o(l)} , (13) 
\k=o J 

where the last equality follows from (3) when E 1 (R 1 , T) > 0. Thus the average effective transmission 
rate is given by 

InM InM 

^~ ~ iV[l + o(l)] ~* Rl ' 
as N — > oo when Ei(Ri,T) > 0. The average probability of error is given by 

L-l 



Pr MD (E) = ^[Pr(X)]^- 1 ) Pr(e) + [Pv(X)}^ Pr (ML) (e) 
k=i 

= Pr(e)[l + o(l)] + [Pr(X)] (L - 1) Pr (ML) (e) (14) 

< e -N[E 1 (R 1 ,T)+T\ [ X + ^^j + e -NlE r (R 1 )+(L~l)Ei(Ri,T)} ? ( 15 ) 

where the inequality follows from (3) and the random coding upper bound on the ML decoding 
error probability [3]. Letting E 1 (R 1 , T) — > and maximizing T as before, we get the following error 
exponent 

E MD (R,L) = limsup - lnPr ^ g ( E ) > m m{E F (R),E r (R)} = E r (R), 

N-^oo T 

since the feedback exponent Ep(R) is known to be greater than the random coding exponent E r (R). 
Thus by setting Ei(R 1 ,T) — > 0, as suggested by intuitive reasoning, we find that memoryless 
decoding does not give any improvement over ML decoding without feedback. However, we can 
get better performance by optimizing the expression in (15) w.r.t T without letting Ei(R±,T) — > 0. 
From (15), it is clear that the optimal value of the threshold T* is the one that yields 

E 1 (R 1 ,T*)+T* = E r (Ri) + (L — l)£' 1 (i2 1 , T*) 

=>- T* = E r (R 1 ) + (L- 2) E 1 (R u T* ) . (16) 
Using this optimal value of T* in (5) and solving for Ei(Ri,T*), we get 

p/ D ™*x fE (s,p,p)-pR 1 -sE r (R 1 )\ 

EARi.T ) = max — — . (17) 

o<s<p<i,p\ l + s(L-2) J y ' 
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Since E F (R 1 ) > E^Ri), we have Ei(R 1: T*) > and hence R — > Ri as N — > oo. Thus the error 
exponent of memoryless decoding is lower bounded by 



E MD (R,L) > E 2 (R,T*) 

= E r (R) + (L - 1) 



E ± (R,T*)+T* = E r (R) + (L - 1)E 1 (R,T 
E (s,p,p) - pR- sE r (R) 



max 

0<s<p<l,p 



l + s(L- 2) 



(18) 



Since E^R, T*) > 0, it is clear that the optimal threshold T* satisfies < T* < E F (R) and thus 
the lower bound on E MD (R,L) in (18) is smaller than the feedback exponent E F (R). 
We now derive an upper bound on Emd(R, L) from (14) as follows. 



Pr M D(E) = Pr(e) [1 + (1)] + [Pr{X)]^ P^ ML \s) 
> [Pr(X)]( L - 1 ) Pr( ML )(e) 



> [Pr(X)] (L - 1} (e 



-NE sp (Ri) 



) • 



(19) 



where the last inequality follows from the sphere-packing lower bound on the ML decoding error 
probability [3]. It is easy to see that the probability of erasure Pr(X) of the decoder in (1) decreases 
when the threshold parameter T is decreased. Thus the probability of erasure Pr(X)|T =0 serves as 
a lower bound on Pr(X) for any T > 0. In [5], upper and lower bounds on the erasure and error 
probabilities are derived using a theorem of Shannon et al [6]. From equations (10) and (11) in [5], 
we have 



1 

AM 
and 



M 

J^exp 

m=l 



M 



fi m {s) - sn' m (s) - s^2^Js)\ < Pr(X) + Pr(e) < — ^ exp p m (s) - sp m (s) 



m=l 



1 M 

— ^ exp (i m (s) + (1 - s)fi' m (s) - (1 - s) v / 2)C(s)] < Pr (^) 



m=l 



< 



1 M 

— ^ exp fi m (s) + {l-s)fi' m (s)^ , 



m=l 



where 



In J p(y|x m ) 



(1-s) 



dy . 



It is clear from equation (8) in [5] that the threshold parameter T is related to the parameter p m (s) 
by p m (s) = —NT. Thus the condition T = corresponds to the condition pL m (s) = 0. Moreover, it 
is shown in [5] that p m (s) and fi m (s) are also proportional to N. Using this fact and the condition 



in the above expressions for the upper and lower bounds on Pr(X) and Pr(e), we get 



1 M 
M 



AM 



exp 



m=l 



l+O 



1 M 

< Pr(X)+Pr(e) < — £ exp [^(s)] , (20) 



m=l 
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and 

1 M 

< Pr(e) < — ex P • ( 21 ) 

m=l 

It is clear from (20) and (21) that when T = 0, the exponents of the upper and lower bounds 
coincide as N — > oo, and more importantly, the exponent of the erasure probability Pr(X) is the 
same as that of the error probability Pr(e). These exponents are further equal to the exponent of 
the ML decoding error probability since Pr(e) < Pr (Mi) (e) < Pr( e) + Pr(X). Using this fact and 
the sphere-packing lower bound on the ML decoding error probability in (19), we get 

Pr M D(E) > e ~ NLE ^ E MD (R,L) < LE sp (R) , 

since R — > R\ as N — > oo. □ 



— E 

AM ^ 



cxp 



l^m(s) (l + O 



™ 1 



From Theorem 2, it is clear that ARQ with memoryless decoding does not achieve Forney's error 
exponent Ep(R) when the maximum number of ARQ rounds L is constrained, at least at high rates 
for which LE sp (R) < Ep(R). As expected, when L — > oo, the lower bound on the error exponent 
in (12) becomes 

UmE MD (R,L) > max f E <>(s, P,p) - pR \ = E , R y 

L^oo 0<s<p<l,p \_ S ) 

3.2 Incremental Redundancy ARQ 

We now derive a lower bound on the error exponent of incremental redundancy ARQ. In IR-ARQ, 
the transmitter, upon receiving a NACK message, transmits iV new coded symbols (derived from 
the same message). Since our results hinge on random coding arguments, these new symbols are 
obtained as i.i.d. realizations from the channel capacity achieving distribution. The decoder does 
not discard the received observations in the case of an erasure and uses the received sequences of 
all the ARQ rounds jointly to decode the transmitted message. The following erasure decoding 
rule is employed by the receiver: After the k th ARQ round, the decoder decides on codeword x m iff 
y G li m , where 

74 - f : iffeb £ e ' Mn } • (22) 

and y, {xj} are vectors of length kN, which contain the received sequences and transmitted code- 
words (respectively) corresponding to the k ARQ rounds. If no codeword satisfies the above condi- 
tion, then an erasure is declared by the decoder. It is clear that our formulation allows for varying 
the threshold T k as a function of the number of ARQ rounds k. Using thresholds {T k } that de- 
crease with the number of ARQ rounds k makes intuitive sense since the probability of error will be 
dominated by small values of k (initial ARQ rounds), and hence, one needs to use higher thresholds 
for these k values to reduce the overall probability of error. We let E k denote the event that the 
decoder declares an erasure during all the first k ARQ rounds. We also let E = (the empty set). 
The probability of erasure and error of the decoder in the k th ARQ round will thus be denoted by 
Pr( fc ) (AT | £(£_!)) and Pr( fc )(e|E( fc _ 1 )), respectively. Here the subscript (k) is used to highlight the fact 
that the decoder uses a received sequence of length kN for decoding in the k th ARQ round. We are 
now ready to state our main result in this section. 
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Theorem 3 The error exponent E IR (R, L) achieved by IR-ARQ, under a deadline constraint L, is 
given by 2 

E IR (R,L) > mm{E F (R) , LE r (R/L)} , 0<R<C. 



(23) 



Proof: The average decoding delay for IR-ARQ is given by 

L 



r = kN Pr (Transmission stops after k ARQ rounds) 
k=i 

L-l /k-1 \ 

+ LN (nPr {i) (X\E {i _ 1} ) 
^(k + m^JPr^XlE^)) - J2 kN [H P *d)( X \ E > 

L-l / k 



fe=i 



L-l 



fc=0 

AT 



L-l 



fc=l 



< N 



k=l \i=l 
L-l 



l + ^Pr(X) 



fc=i 



< A r [1 + LPr(X)] . 



(24) 



Since Pr(X) < e (-Ri > T ) ; it follows that r ^ N (and hence the average effective transmission 
rate R — > as iV — > oo when Ei(Ri, T) > 0. The average probability of error of IR-ARQ is given 
by 

L 

Pr /i? (E) = ^ Pr(error in the k th ARQ round) 

k=l 
L-l 

= £ Pr (fc) (e, S (fc _D) + Pr^ L) (e,£; {L _ 1) ) 
fc=i 

< X> w (e) + p^ L) (^)> 



fe=i 



where Pr^e) refers to the probability of error when the decoder always waits for kN received 
symbols before decoding. Following the derivation in [2], it can easily be seen that for the thresholds 
{T k } used in the decoding rule (22), we have 



PWX) < e - kNEl{Rl/k ^ and Pr (fc) (e) < e 



-fcAf[L;i(Ri/fc,T fc )+T fe ] 



(25) 



2 Replacing the random coding exponent E r (R) by the expurgated exponent E ex (R) may yield a tighter lower 
bound at low rates. 
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Using this and the fact that Pt^ L \e) < e LNE r(Ri/L)^ ftie average probability of error of IR-ARQ 
can be upper bounded by 



L-l 



Pt ir (E) < e -fcjy[fii(fli/fc,r fc )+r fc ] + g—LNE r (Ri/L) _ ^6) 



k=i 

Thus the error exponent achieved by IR-ARQ is lower bounded by 

E IR (R,L) = limsup - lnPr ^ (E) > m in (LE r (RlL), { k\E^{R/k,T k ) + T k ]} L k Zl) ■ 
N^oo r \ / 

Taking T k = (T/k), VA; e {1, • • • , (L - 1)}, we get 

E m (R,L) > min ^LE r (R/L), { kE x {R/k,T/k) + T j^ 1 ) 

= min(L£ r (i2/L), £i T) + T) , 

where the last equality follows from the fact that Ei(R/k,T/k) is an increasing function of k. 
Letting Ei(R,T) — > and maximizing T, we get 

E IR {R,L) > min (LE r (R/L), E F (R)) . 

□ 

From Theorem 3, it is clear that if the deadline constraint L is large enough to satisfy 

LE r (R/L) > E F (R) , (27) 

then IR-ARQ achieves the feedback exponent Ep(R) at rate R. In the following section, we quantify 
the gains achieved by IR-ARQ, as compared with memoryless decoding, for specific channels. 

4 Examples 

4.1 The Binary Symmetric Channel (BSC) 

Here, we compare the error exponents achievable by memoryless decoding and IR-ARQ over a BSC 
with crossover probability e. The bounds on the error exponents in (12) and (23) are plotted for 
a BSC with e = 0.15 in Figs. 1(a) and 1(b) for L = 2 and L — 4, respectively. The ML decoding 
error exponent (corresponding to the case L — 1) and the feedback exponent Ep{R) are also plotted 
for comparison purposes. From Fig. 1(a), we find that when L = 2, memoryless decoding achieves 
an error exponent that is strictly sub-optimal to the feedback exponent E R (R) for all R > 0.006. 
On the other hand, IR-ARQ achieves Ep(R) for 0.18 < R < C . Moreover, it performs strictly 
better than memoryless decoding for all R > 0.057. When L — 4, from Fig. 1(b), we find that the 
error exponent for the memoryless decoder is strictly sub-optimal, as compared with Ep(R), for 
R > 0.141, while IR-ARQ achieves E F (R) for all rates below capacity. Finally, we note that even 
when L = 100, memoryless decoding is still strictly sub-optimal, as compared with IR-ARQ, for all 
rates 0.38 < R < C = 0.39. 
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Now, we elaborate on our observation from Fig. 1(b) that L = 4 is sufficient to achieve E F {R) 
with 1R-ARQ when e = 0.15. In particular, we wish to investigate the existence of a finite value 
for L such that E F (R) is achieved by IR-ARQ universally (i.e., for all < e < 0.5 and all rates 
below capacity). Towards this end, we derive an upper bound on the minimum required deadline 
constraint L req for a given BSC(e). From (23), it is clear that L req is upper bounded by the minimum 
value of L required to satisfy LE r (R/L) > E F (R) for all < R < C. We first prove the following 
result. 

Lemma 4 A sufficient condition for ensuring that LE r (R/L) > E F (R) for all rates < R < C 
for a BSC is given by LE r (0) > E F (0). 

Proof: It has been shown in [3] that both the random coding exponent E r (R) and the feedback 
exponent E F (R) are decreasing functions of R. Since 

LE r (R/L) = max {LE (p) - pR} , (28) 

0<p<l 

its slope at a given rate R is given by (following the steps in equations (5.6.28-5.6.33) in [3]) 

d{LE r {R/L)) 

dR " -PW* - 1 ' 

where p*(R) is the value of p that maximizes the RHS of (28) for rate R. For a BSC, it is shown 
in [2] that the feedback exponent can be expressed as 

E F (R) = (C-R) + max{£ (p) - pR} . (29) 
Hence the slope of E F (R) at a given rate R is given by 

where p (R) is the value of p that maximizes the RHS of (29) for rate R. Hence it is clear that 
for any value of R, the rate of decrease of the feedback exponent E F (R) is higher than that of 
LE r (R/ L). It is shown in [2] that E F (C) = E r (C) = 0. Since E r (R) is a decreasing function of 
R, we know that E r (C/L) > E r (C) = 0. Thus, when R = C,we have LE r (C/L) > E F (C). Now, 
if the value of L is chosen such that LE r (0) > E F (0), it is clear that the curve LE r (R/ L) lies 
strictly above the curve E F (R) in the range < R < C. This directly follows from the fact that the 
feedback exponent E F (R) decreases faster than LE r (R/L). Hence the condition LE r (0) > E F (0) 
is sufficient to guarantee that LE r (R/L) > E F (R) for all < R < C. □ 

The above lemma shows that for any BSC(e), an upper bound on L req depends only on the 
values of E F (R) and E r (R) at R — 0. From the results in [3], it can be shown that 

E r (0) = ln2-ln(l + 2v/ e (l-e)) and E F (0) = C-ln2-ln ^e(l - e)) . (30) 

Using Lemma 4 and (30), we find that a deadline constraint of L = 4 is enough to achieve the 
feedback exponent E F (R) at all rates below capacity for any BSC with crossover probability 0.05 < 
e < 0.5. However, the upper bound on L req , derived using Lemma 4, becomes loose as e — > 0. To 
overcome this limitation, we use the expurgated exponent E ex (R) [3] instead of the random coding 
exponent E r (R) at low rates. Using numerical results, we find that the actual value of the minimum 
required deadline constraint is L req = 3 for all BSCs with e < 0.025, and L req = 4 otherwise. 
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4.2 The Very Noisy Channel (VNC) 

As noted in [3], a channel is very noisy when the probability of receiving a given output is almost 
independent of the input, i.e., when the transition probabilities of the channel are given by 

p jk = + e jk ) , 

where {ujj} denotes the output probability distribution, and {ejk} are such that \ejk\ <C 1 for all j 
and k, and ^j^jtjk — 0, V7c. We plot the bounds on the error exponents given in (12) and (23), 
derived from the results in [2], in Figs. 2(a) and 2(b) for a VNC with capacity C = 1 for L = 2 and 
L = 4 respectively. From the plots, it is clear that memoryless decoding is strictly sub-optimal to 
IR-ARQ for all rates R > 0.12 (with L = 2) and R > 0.25 (with L = 4). Moreover, it is evident 
that L = 4 is sufficient for IR-ARQ to achieve the feedback exponent E F (R) for all rates below 
capacity. This observation motivates the following result. 

Lemma 5 For the very noisy channel, a deadline constraint of L = 4 is enough for the proposed 
incremental redundancy scheme to achieve the feedback exponent E F (R) for all rates < R < C . 

Proof: For a VNC, the random coding exponent is given by [2] 

< R < % 

E r (R) = I f )1 ~ ~ 4 . (31) 

Thus, under the deadline constraint L 

AE r {R/4) = 4(^--^) = 2C-R, 0<R<C. 




Also 

E F (R) = (C-R) + (VC - VR) 2 < (C-R) + (VC) 2 = AE r (R/A) . 
Putting L = 4 in (23), the error exponent of IR-ARQ is given by 

E IR (R,A) > mm{E F (R) , AE r (R/A)} = E F (R) . (32) 

Thus, for a VNC, it is clear that a deadline constraint of L = 4 is enough for IR-ARQ to achieve 
the feedback exponent E F (R) at all rates below capacity. □ 



4.3 The Additive White Gaussian Noise (AWGN) channel 

The random coding and expurgated exponents for an AWGN channel with a Gaussian input of 
power A and unit noise variance, are given in [3]. The sphere-packing exponent of the AWGN 
channel is derived in [7-9]. The parameter E Q (s,p,p) in the lower bound in (12) is replaced by 
E Q (s, p, t) which, following the steps in the derivation of the random coding exponent in [3], is given 
by 

E (s,p,t) = (l + p )M+Q)log(l-2M)+(01og(l-2L4+y) 



ii 



The feedback exponent for the AWGN channel is then given by [2, 3] 



E F (R) 



max 

0<s<p<M>0 



( 



E a (s,p,t) - pR 



s 



) 



We plot the bounds on the error exponents, given in (12) and (23), in Figs. 3(a) and 3(b) for an 
AWGN channel with signal-to-noise ratio A = 3 dB for the deadline constraints L = 2 and L = 4 
respectively. The plots clearly indicate that memoryless decoding is strictly sub-optimal to IR-ARQ 
for all rates R > 0.19 (with L = 2) and R > 0.46 (with L — 4). Moreover, when L — 4, the proposed 
IR-ARQ scheme achieves the feedback exponent E F (R) for all rates below capacity (at the moment, 
we do not have a proof that this observation holds universally as in the case of BSCs). 

5 Conclusions 

We considered the error exponents of memoryless ARQ channels with an upper bound L on the 
maximum number of re-transmission rounds. In this setup, we have established the superiority of 
IR-ARQ, as compared with Forney's memoryless decoding. For the BSC and VNC, our results show 
that choosing L = 4 is sufficient to ensure the achievability of Forney's feedback exponent, which 
is typically achievable with memoryless decoding in the asymptotic limit of large L. Finally, in the 
AWGN channel, numerical results also show the superiority of IR-ARQ over memoryless decoding, 
in terms of the achievable error exponent. 
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Figure 1(a): Comparison of the error exponents for a BSC with e = 0.15 and L = 2 
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Figure 1(b): Comparison of the error exponents for a BSC with e = 0.15 and L = 4 
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Figure 2(a): Comparison of the error exponents for a VNC with C — 1 and L = 2 
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Figure 2(b): Comparison of the error exponents for a VNC with C = 1 and L = 4 
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Figure 3(a): Comparison of error exponents for an AWGN channel with SNR = 3 dB and L = 2 
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Figure 3(b): Comparison of error exponents for an AWGN channel with SNR = 3 dB and L = 4 
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